Goto

Collaborating Authors

 sample analysis


Reviews: Finite sample analysis of the GTD Policy Evaluation Algorithms in Markov Setting

Neural Information Processing Systems

It is well known that the standard TD algorithm widely used in reinforcement learning does not correspond to the gradient of any objective function, and consequently is unstable when combined with any type of function approximation. Despite the success of methods like deep RL, which combines vanilla TD with deep learning, theoretically TD with nonlinear function approximation is demonstrably unstable. Much work on fixing this fundamental flaw in RL has been in vain, till the work on gradient TD methods by Sutton et al. Unfortunately, these methods work, but their analysis was flawed, based on a heuristic derivation of the method. A recent breakthrough by Liu et al. (UAI 2015) showed that gradient TD methods are essentially saddle point methods that are pure gradient methods that optimize not the original gradient TD loss function (which they do not), but rather the saddle point loss function that arises when converting the original loss function into the dual space.


Getting lost in space: Large sample analysis of the resistance distance

Neural Information Processing Systems

The commute distance between two vertices in a graph is the expected time it takes a random walk to travel from the first to the second vertex and back. We study the behavior of the commute distance as the size of the underlying graph increases. We prove that the commute distance converges to an expression that does not take into account the structure of the graph at all and that is completely meaningless as a distance function on the graph. Consequently, the use of the raw commute distance for machine learning purposes is strongly discouraged for large graphs and in high dimensions. As an alternative we introduce the amplified commute distance that corrects for the undesired large sample effects.


A Multi Clustered approach for Predicting Covid-19 Spread

#artificialintelligence

As vaccine production & procurement processes are ramping up, the distribution of vaccines is a thing of concern. As large amount vaccine units roll out, the first step is strategic & wise distribution among regions, considering conducive & causative factors raising the urgency of requirements. For this, organizations & governments may look upon the predictive suggestions [1] backed by data to chart out further plans. Many countries, particularly those in the developing world, where governments are struggling to procure vaccines to vaccinate their residents. One of the decisions to be taken strategically is the wise & calculated distribution of the vaccine received. Although few countries are actively trying to increase vaccination rates, the overall 56% world population is yet to take their first vaccine dose.


Getting lost in space: Large sample analysis of the resistance distance

Luxburg, Ulrike V., Radl, Agnes, Hein, Matthias

Neural Information Processing Systems

The commute distance between two vertices in a graph is the expected time it takes a random walk to travel from the first to the second vertex and back. We study the behavior of the commute distance as the size of the underlying graph increases. We prove that the commute distance converges to an expression that does not take into account the structure of the graph at all and that is completely meaningless as a distance function on the graph. Consequently, the use of the raw commute distance for machine learning purposes is strongly discouraged for large graphs and in high dimensions. As an alternative we introduce the amplified commute distance that corrects for the undesired large sample effects.